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Characterizing and modeling cyclic behavior in non-stationary time series through 

multi-resolution analysis 
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A method based on wavelet transform and genetic programming is proposed for characterizing and 
modeling variations at multiple scales in non-stationary time series. The cyclic variations, extracted 
by wavelets and smoothened by cubic splines, are well captured by genetic programming in the form 
of dynamical equations. For the purpose of illustration, we analyze two different non-stationary 
financial time series, S&P CNX Nifty closing index of the National Stock Exchange (India) and 
Dow Jones industrial average closing values through Haar, Daubechies-4 and continuous Morlet 
wavelets for studying the character of fluctuations at different scales, before modeling the cyclic 
behavior through GP. Cyclic variations emerge at intermediate time scales and the corresponding 
dynamical equations reveal characteristic behavior at different scales. 

PACS numbers: 05.45.Tp, 89.65.Gh 



I. INTRODUCTION 

A number of non-stationary time series are known to 
comprise of fluctuations having both stochastic and cyclic 
or periodic components. Isolating fluctuations from these 
time series, at different scales for the purpose of char- 
acterization and modeling is a research area of signifi- 
cant interest [3, [3, 0] • Here we explicate a wavelet based 
approach for separating structured variations from the 
stochastic ones in a time series before modeling them 
through genetic programming (GP). For the purpose of 
illustration we have chosen two financial time series, since 
the same is well-known to exhibit random and structured 
behavior at different scales EH. At small time scales, 
the fluctuations are primarily stochastic in nature; at 
higher scales the random part is averaged out and char- 
acteristic nature of the variations become transparent. 
For a reliable analysis one also needs to have a reason- 
ably long data set for which the financial time series are 
well suited. The chosen time series are S&P CNX Nifty 
closing index of the National Stock Exchange (India) and 
Dow Jones industrial average closing values, representing 
two sufficiently different economic climates so as to bring 
out the efficacy of the present approach. 

Study and characterization of fluctuations in financial 
time series have been studied through a variety of ap- 
proaches. For example, variations have been analyzed 
through Levy-stable non-Gaussian model @. Stochastic 
nature of the high frequency fluctuations and presence 
of structured behavior have emerged through study of 
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these time series using random matrix theory. In partic- 
ular, analysis of the cross-correlations between different 
stocks 0, H[ reveal universal and non-universal phenom- 
ena. The latter ones indicate correlated behavior between 
stocks of different companies. This behavior can manifest 
in the composite stock price indices, where the correlated 
behavior of several companies can give rise to structured 
or cyclic behavior in appropriate time scales. Wavelet 
transform, because of its multi-resolution analysis prop- 
erty is well-suited to isolate fluctuations and variations 
at different scales d E3] • 

The goal of the present article is to demonstrate the 
usefulness of combining wavelet transform with tools like 
genetic programming for modeling of fluctuations. We 
carry out our analysis on two different financial time 
series, in order to find out the similarities and differ- 
ences between them, from the perspective of fluctuations. 
Apart from sharp transients representing sudden varia- 
tions ascribable to physical causes, the high frequency 
fluctuations at small scales are primarily random in char- 
acter. In all the time series, cyclic behavior emerges 
at higher scales, when the random fluctuations are av- 
eraged out. The physical nature of the cyclic phenomena 
is substantiated through both discrete and continuous 
wavelets. In case of continuous wavelets, the scalogram 
clearly reveals cyclic behavior at intermediate scales. We 
then proceed to model these variations through Genetic 
Programming (GP) [HI, El El El El- For that pur- 
pose, we smoothen the cyclic behavior at every scale, 
corresponding to different levels of wavelet decomposi- 
tion, through a cubic spline. It needs to be mentioned 
that, since the purpose is to model cyclic behavior, physi- 
cally it is meaningful to smoothen the same, before trying 
to model them. 

We study the fluctuation characteristics of two differ- 
ent financial time series, S&P CNX Nifty closing index 
of the National Stock Exchange (India) and Dow Jones 
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2048 Daily Observations: 24th Dec 1997 - 16th Feb 2006 

FIG. 1: S&P CNX Nifty closing index data having 2048 points 
covering the daily index lying within 24-Dec-1997 to 16-Feb- 
2006. 
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FIG. 3: Haar wavelet low pass coefficients for level 8 for Nifty 
data. As expected, these coefficients resemble the average 
behavior of the time series. 
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FIG. 2: Haar wavelet fluctuation coefficients for levels 1 to 
8 for Nifty data. Transient and stochastic behavior at small 
scales and ordered variations at higher scales are evident. 
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FIG. 4: Normalized power of Haar wavelet coefficients for dif- 
ferent levels 1 to 8 for Nifty data. Indication of rapid increase 
from 6 th level is clearly visible. 



well. The dynamical equations are primarily linear with 
nonlinear additive terms of Pade type. These equations 
are checked for their predictive capabilities by making 
out-of-sample predictions. One-step out-of-sample pre- 
dictions are made which use given time lagged values 
successively and predict the next data set value. It is 
found that the one-step predictions are very good. 



industrial average closing values, through wavelet trans- 
forms belonging to both discrete and continuous families. 
Haar and Daubechies-4 (Db4) from the discrete wavelet 
family and the continuous Morlet wavelet are used to an- 
alyze the time series. As has been observed earlier, at 
small scales, the fluctuations captured by the wavelet co- 
efficients exhibit self-similar character Clear cyclic 
behavior emerges in medium scales and is evident from 
both discrete and continuous wavelet analysis. It is found 
that, GP captures the cyclic behavior at each scale quite 



The paper is organized as follows. In Section II, we 
give a brief introduction to wavelets before carrying out 
wavelet decomposition of both the data sets considered 
to study the character of the variations at different scales. 
Cyclic variations at different scales is extracted through 
Daubechies-4 wavelet and confirmed by continuous Mor- 
let wavelet. We then proceed to model the cyclic phe- 
nomenon through GP in section III and conclude in sec- 
tion IV, after pointing out a number of applications and 
future directions of work through the present method. 
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FIG. 5: Normalized power of Haar wavelet low pass coeffi- 
cients for different levels I to 8 for Nifty data. One observes 
decrease in low-pass power from 6 th level onwards. 
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FIG. 7: Scalogram of Morlet wavelet coefficients for scales 
101-200 of S&P CNX Nifty closing index values. 
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FIG. 6: Db4 Wavelet coefficients for different levels 1 to 8 for 
Nifty data. Cyclic behavior at intermediate scales are well 
captured by the wavelet coefficients. This behavior is present 
both at local and global levels. 



II. WAVELET TRANSFORM 

Wavelet transform provides a powerful tool for the 
analysis of transient and non-stationary data and is par- 
ticularly useful in picking out characteristic variations 
at different resolutions or scales jl7| . In the context of 
financial time series [HI, [l9| , it has found extensive ap- 
plications. It has been used for the study of commodity 
prices [20], in measuring correlations (2l| , in the study 
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FIG. 8: Sum of wavelet coefficients for Nifty data over all 
scales as a function of time. An approximate periodic behav- 
ior with periodicity of about 200 trading days is evident. 



of foreign exchange rates [22j and for predicting stock 
market behavior |23l . [2~j |. to name a few. This linear 
transform separates a data set in the form of low-pass 
or average coefficients, resembling the data itself, and 
wavelet or high-pass coefficients at different levels, which 
capture the variations at corresponding scales. Wavelets 
can be continuous or discrete. In the latter case, the 
basis elements are strictly finite in size, enabling them 
to achieve localization, while disentangling characteristic 
variations at different frequencies. 

In discrete wavelet transform (DWT), the construc- 
tion of basis set starts with the scaling function ip(x) 
(father wavelet) and the mother wavelet ip(x), whose 
height and width are arbitrary: J ipdx = A, J ipdx = 
0, J tpipdx = 0, / \<p\ 2 dx = 1, J \ip\ 2 dx = 1, where A is 
an arbitrary constant. The scaling and wavelet functions, 
and their scaled translates, known as daughter wavelets, 
rpj k = 2 J l ' 2 f/ , (2 J x — k), are square integrable at different 
scales. Here, k and j respectively are the translation and 
scaling parameters, with — oo < k < +oo. Although con- 
ventionally, one starts with the scale value j = 0, one 
can begin from any finite value j' and increase it by in- 
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tegral units. The original mother wavelet corresponds to 
■00.0 ■ The daughter wavelets are of a similar form as the 
mother wavelet, except that their width and height dif- 
fer by a factor of 2 J and 2 J / 2 respectively, at successive 
levels. The translation unit k/2 J is commensurate with 
the thinner size of the daughter wavelet at scale j. In 
the limit j ' ^ oo, these basis functions form a complete 
orthonormal set, allowing us to expand a signal f(t) in 
the form. 
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+00 



/(*) = c 3,Wj,k{t) + Y d r ,ki>y At) (i) 



fe=-c 



fc=-oo j'>j 



Here, Cj-fc's are the low-pass coefficients and cL^'s are 
the high-pass or wavelet coefficients. They respectively 
capture the average part and variations of the signal 
at scale j and location k. For the discrete wavelets, 
the property of multi-resolution analysis (MRA) leads 

to Cj,k = J2n h i n - 2fc)Cj + i,„, dj <k = J2 n K n - 2k)c j+liTl , 

where h(n) and h(n) are respectively the low-pass (scal- 
ing function) and high-pass (wavelet) filter coefficients, 
which differ for different wavelets. Both low-pass and 
high-pass coefficients at a scale j can be obtained from 
the low-pass coefficients at a higher scale (c J+ i.„). This 
implies that, starting from the finest resolution of the 
signal, one can construct both scaling and wavelet coef- 
ficients, by convolution with the filter coefficients h(n) 
and h(n). 

For the Haar wavelet: h(0) — h(l) — and h(0) = 
—h(l) = -j-. Haar basis is unique, since it is the only 
wavelet, which is symmetric and compactly supported. 
In a level one Haar wavelet decomposition, the level-I 
low-pass (average) and high-pass (wavelet or detail) co- 
efficients are respectively given by the nearest neighbor 
averages and differences, with the normalization factor 
of A=. In the subsequent step, the average coefficients 
are divided into two parts, containing level-II high-pass 
and level-II low-pass coefficients. The high-pass coeffi- 
cients now represent differences of averaged data points 
corresponding to a window size of two. Wavelets be- 
longing to Daubechies family are designed such that, 
the wavelet coefficients are independent of polynomial 
trends in the data. For example, Daubechies-4 wavelet 
satisfies, J tip(t)dt = 0, in addition to all other condi- 
tions. Because of this the wavelet coefficients here cap- 
ture fluctuations over and above the linear variations. 
For Daubechies-4, h(0) = -h(3) = h{\) = h{2) = 



m = -HD = S=#, fc(3) = -MO) = 
We have used both Haar and Daubechies-4 wavelets 
for isolating these fluctuations at different scales and 
study their character. For continuous wavelet transform 
(CWT), we have utilized Morlet wavelet, whose analyz- 
ing function is given by, 



7r -l/4 e (-^ot-t J /2)^ 



(2) 



The corresponding wavelet coefficients are displayed as a 
function of scale and time in a scalogram. 



In DWT, a maximum of N level decompositions can 
be carried out, when the data set is of size 2 . One may 
choose to have a less number of decompositions than N. 
Often one needs to supplement the data with additional 
points to carry out a N level decomposition. Both in 
DWT and CWT, one encounters boundary artifacts, due 
to circular or other forms of extensions. In our case, for 
minimizing these boundary artifacts, we have used sym- 
metric extension, while studying the behavior of wavelet 
coefficients. The variations at different scales are charac- 
terized by their respective powers, defined as the squared 
sum of the wavelet coefficients at that level. Since in 
wavelet transform power is conserved, the squared sum 
of all the low pass and high pass coefficients add up to the 
squared sum of the data points of the time series, called 
as the total power or energy. We have used normalized 
power which is the power at a given level divided by the 
total power. Periodic extension has been used for ana- 
lyzing the distribution of power at various levels, since 
this extension conserves power. The power plots depict- 
ing high pass power and low pass power clearly reveal the 
character of the fluctuation and the average behavior. 

We start with the National Stock Exchange (NSE) of 
India Nifty daily closing index values, consisting of 2048 
data points, covering the duration from 2A th December 
1997 to 16 th February 2006. This daily index is shown 
in Fig. Q] We carry out an eight level decomposition 
through Haar transform, since the same yields a trans- 
parent picture about the nature of the variations. The 
high pass coefficients are depicted in Fig. [2] Transient 
and cyclic behavior at different scales are clearly visible. 
The corresponding low pass coefficients corresponding to 
8 th level are depicted in Fig. [3] As expected, these low 
pass coefficients capture the average behavior of the time 
series (see Fig. Q]). At finer resolutions corresponding to 
lower level wavelet coefficients, one can clearly see pri- 
marily random nature of the fluctuations. 

The non-random variations significantly increase from 
6 th level onwards. Substantial increase in power at 6 th 
level is also evident in Fig. fj] which shows the behavior 
of the normalized power of the high pass coefficients. The 
normalized power of low pass coefficients shown in Fig. 
[5] start decreasing from 6 th level onwards, since the total 
power is conserved in wavelet transform. It is worth men- 
tioning that the 6 th level high pass coefficients correspond 
to the differences of data points, averaged over a tempo- 
ral window size 32. A few transient phenomena are also 
revealed by these coefficients. After sufficient averaging, 
cyclic behavior is seen to emerge. The averaged low pass 
coefficients reveal a linear trend like the time series in Fig. 
[TJ this can affect the high pass coefficients. In order to 
remove this and capture the characteristic nature of the 
variations, we have carried out decomposition through 
Daubechies-4 wavelets. The structured and cyclic behav- 
ior is transparently demonstrated in Fig. [6l This justifies 
the use of Daubechies-4 wavelets, which removes linear 
trend from the high-pass coefficients. Plots of both high- 
pass and low-pass normalized power reveal that at higher 
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2048 Daily Observations: 22nd Feb 1993 - 25th Jan 2001 

FIG. 9: DJIA closing values having 2048 points covering the 
daily index lying within 22-Feb-1993 to 25-Jan-2001. 



scales fluctuations capture most of the energy; the high- 
pass power increases, while that of low-pass decreases 
progressively. CWT through Morlet wavelets also reveal 
the cyclic behavior at intermediate scales as seen in Fig. 
The scale values indicate the window size of the Mor- 
let wavelet corresponding to the same number of days. 
We depict in Fig. sum over the continuous wavelet 
coefficients over scales as a function of time. An approx- 
imate periodic behavior is seen with a period of about 
200 trading days. The fact that wavelet coefficients con- 
taining both positive and negative values add up to yield 
a periodic behavior indicates the presence of correlated 
behavior. As is clear, purely random un-correlated coef- 
ficients will not lead to this structure. 

We next consider the Dow Jones industrial average 
(DJIA) closing values, shown in Fig. [5] having 2048 
data points for the duration lying within 22 nd February 
1993 to 25 th January 2001. An eight level decomposi- 
tion through Haar transform is then carried out to infer 
about the nature of the variations. The high pass co- 
efficients are depicted in Fig. [TUl Transient and cyclic 
behavior at different scales are clearly visible like the pre- 
vious case of Nifty data. It is to be noted that the 2 nd 
half of the forward wavelet coefficients for each level is 
having higher amplitudes as compared to the first half 
of the coefficients. In this respect, variations of wavelet 
coefficients of Nifty data and DJIA data have different 
characteristics. The corresponding low pass coefficients 
are shown in Fig. [TT] As is evident, these low pass coeffi- 
cients show the trend of the time series. The normalized 
power of the high pass coefficients (Fig. [T2"|) shows a con- 
tinuous rise with levels having a small decline at level 8. 
Correspondingly the low pass coefficients show continu- 
ous decrease in the power (Fig. [T3]). In Fig. [TJ]we show 
the Db4 wavelet coefficients for levels 1 to 8 for the pur- 
pose of comparison with the corresponding Nifty index 
behavior. 

We have studied the behavior of DJIA under CWT. 
The corresponding scalogram is shown in Fig. 1151 Akin 
to the Nifty case one sees cyclic behavior in the scale 
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FIG. 10: Haar wavelet coefficients for levels 1 to 8 for Dow 
Jones data. 
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Low Pass Coefficients (Level 8) 

FIG. 11: Haar wavelet low pass coefficients for levels 1 to 
for Dow Jones data. A linear trend is seen. 



range 100 to 200. The sum of the wavelet coefficients at 
all scales plotted as a function of time (Fig. fl6|) reveal a 
periodic behavior of little less than 200 trading days. In 
this context both the financial time series show similar 
behavior. However the DJIA time series is showing a 
tendency of bursty behavior which is absent in Nifty case. 

Considering the cyclic behavior of variations at inter- 
mediate scales, it is interesting to see how well these 
wavelet coefficients can be analyzed through the tech- 
niques of Genetic Programming in which the model equa- 
tions are built in the reconstructed phase space. This 
modeling can reveal characteristic behavior of fluctua- 
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FIG. 12: Normalized power of Haar wavelet high pass coeffi- 
cients for levels 1 to 8 for Dow Jones data. Significant increase 
in power is seen around 6 th level. 
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FIG. 13: Normalized power of Haar wavelet low pass coeffi- 
cients for levels 1 to 8 for Dow Jones data. Complimenting 
the high-pass behavior, the low-pass power decreases rapidly 
from 6 th level onwards. 



tions at different scales. 



III. MODELING CYCLIC WAVELET 
COEFFICIENTS THROUGH GENETIC 
PROGRAMMING 

In Genetic Programming one assumes the map equa- 
tion relating time lagged variables with the entity X t +\ 
to be of the form [15|, (25| , 



X 



t+i 



f(Xf, X t - T , X t -2r, ■■■X t -(d-l)r) 



(3) 



Here f represents a function involving time series values 
X t in the immediate past and arithmetic operators (+, 
-, x and -=-). The numbers appearing in function f are 
bounded between the range [-N, N], where N is an inte- 
ger number, we have chosen N to be 10. The numbers 
within the above range are generated with the precision 
of 1 digit after decimal point. In the above equation, d 
represents the number of previous time series values that 
may appear in the function and r represents a time delay. 
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FIG. 14: Db4 wavelet coefficients for levels 1 to 8 for Dow 
Jones data. Significant activity is seen in the second half of 
the wavelet coefficients. 
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FIG. 15: Scalogram of Morlet Wavelet coefficients for scales 
101-200 of DJIA closing values indicating a cyclic behavior. 



During the GP optimization, one considers a pool of 
chromosomes. A chromosome represents a potential solu- 
tion. Evolution to successive generations is then carried 
out stochastically by applying genetic operators, namely 
copy, crossover and mutation. 

The sum of squared errors, 



i=N 

A 2 = J2(X? alc - X? iven ) 2 
i=i 



(4) 



is minimized, where N represents number of X t values 
(Eq. [3]) that are fitted during the GP optimization. 
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FIG. 16: Sum of wavelet coefficients for DJIA over all scales 
as a function of time. An approximate cyclic behavior with 
periodicity of a duration of little less than 200 trading days is 
evident. 



For a given chromosome, the lower the above sum of 
squared errors, the better is the fit generated and there- 
fore the corresponding chromosome fairs better chance of 
participating in further evolutionary process through its 
fitness measure. The fitness measure is derived from A 2 
and is defined as in Eq. [5J 



R z 



1 



A 2 



i—N 



(5) 



given 



x 



given\2 



where xf lven is the average of all Xi (Eq. [3]) to be fitted. 

It is observed that during the optimization process, in 
order to get higher and higher fitness measures, GP may 
lead to quite involved chromosome strings. In order to 
discourage GP to over fit by generating longer strings 
of chromosomes, the fitness measure is modified fl5j as 
follows, 



= 1 - (1 - R 2 



N - 1 



(6) 



where N is the number of equations to be fitted in the 
training set and k is the total number of time lagged 
variables of the form X t , X t - T , X t -2 T , ■■■ etc (including 
repetitions) occurring in the given chromosome. This 
modified fitness measure prefers a parsimonious model by 
generating crisp map equations for chromosomes. For R 2 
close to 0, the modified fitness measure r can be negative. 

The map equations generated by above GP prescrip- 
tion is then used to make out-of-sample predictions out- 
side the fitted set of data and the measure of goodness 
of predictions is ascertained by normalized mean square 
error(NMSE) as given by Eq. 



1 E£f(* 



NMSE 



X 



given \ 2 



(7) 



N variance of N data points 

On trying to model these wavelet coefficients, it is 
found that due to sharp variations, the GP optimiza- 
tion does not lead to convergence having good fitness 




FIG. 17: Db4 wavelet coefficients (wc) and Spline interpolated 
we for 6 th level for Nifty data. 



values. We have therefore found it necessary to smoothen 
these wavelet coefficients using an appropriate method. 
For all the wavelet coefficients of different levels consid- 
ered, we smoothen them using a cubic Spline interpo- 
lation method. We generate smoothened coefficients by 
incorporating 4 additional points which are sampled by 
cubic Spline method for each consecutive pair of points. 
The piecewise polynomial form of the smoothened data 
is well suited with a similar structure used for the map 
equation (Eq. [3]) searched by Genetic Programming. It 
is worth emphasizing that this procedure is appropriate 
since we are modeling cyclic behavior which are gener- 
ally smooth in nature as compared to sharp variations of 
transients. 

We now proceed with modeling of smoothened Db4 
forward wavelet coefficients using GP at level=6, 7 and 



Modeling variations in S&P CNX Nifty closing 
index 



As seen earlier cyclic and structured variations emerge 
at relatively higher scales. For modeling purpose, we 
consider the 6, 7 and 8 th level coefficients generated using 
Db4 forward wavelet transform. These coefficients show 
considerable cyclic fluctuations both at local and global 
scales. At 6 th level, bursty behavior is also seen. 

We have divided the values of these wavelet fluctua- 
tions by 1000 for the sake of computational convenience. 
The smoothened wavelet coefficients for level=6, 7 and 
8 are shown in Figs. [TTJ [TH] and Q33 along with original 
wavelet coefficients. 

The smoothened wavelet coefficients are then mod- 
elled using GP. We have used d=5 and r=l for these 
fits and the resulting fits are very good having fitness 
values 0.99499 (level=6), 0.99498 (level=7) and 0.997038 
(level=8). 

The map equations are shown in Eq. [5J [9] and [TTJ] 
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FIG. 18: Db4 wavelet coefficients (wc) and Spline interpolated 
we for 7 th level for Nifty data. 
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FIG. 20: Fit for 130 data points using GP solution for Db4 
level-6 wavelet coefficients for Nifty data. 




FIG. 19: Db4 wavelet coefficients (wc) and Spline interpolated 
wc for 8 th level for Nifty data. 
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FIG. 21: Fit for 55 data points using GP solution for Db4 
level-7 wavelet coefficients for Nifty data. 



(Level=e) 

K t+1 



= 1.9762X t + ■ 



2.5298X f 



x *-2t + 10 -° 
0.0006023X(_ 2t 



1.5627X t _ T + 0.3008X t _ 3T 



type rational terms, has rather small coefficients as com- 
pared to those for the linear terms. It is also noted that 
the significant departure of the obtained solutions from a 
persistent solution (X t +i=X t ) shows the efficacy of the 
GP optimization approach. In the context of specific lev- 



< t + V"' 1 7> ~ 2 - 814x t - 2.9401X t _ T + 1.1239X t _ 2 T - 0.01761X t _ 3t (9) 



= 1.5X t - 0.6522X t _ 2x 



0.368(X t _ T - 2.9X t _ 4x + 1.3103) 



The GP fit obtained by these equations are quite good 
and these are shown for level=6, 7 and 8 in Figs. [2Ql [21] 
and [221 respectively. 

The goodness of the fits is indicated by the small values 
of the differences between given and calculated values 
shown by the line close to 0.0. 

It is worth pointing out that the GP map equations 
representing cyclic variations are primarily found to be 
linear. The nonlinearity if any, arising from the Pade 
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FIG. 22: Fit for 18 data points using GP solution for Db4 
level-8 wavelet coefficients for Nifty data. 
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FIG. 23: Contributions of individual terms in the right hand 
side of Eq. [8] for Db4 level-6 wavelet coefficients for Nifty 
data. It is observed that the first and third terms are the 
dominant ones which give rise to the bursty behavior (Fig. 
I20[) due to their slight out of phase dynamics. 



els, we observe that the 6 th level variations show cyclic 
as well as bursty behavior. Interestingly, the 7 th level co- 
efficients show smooth variations; the corresponding GP 
equation (Eq. [9]) is completely linear. On the contrary 
the 8 th level variations show non-smooth variations which 
are not bursty like the 6 th level coefficients. 

In order to understand the interplay of different terms 
giving rise to the bursty behavior, we have computed the 
contributions arising from each term of Eq. [8] individu- 
ally. The same is shown in Fig. [23l It is clearly seen 
that the first and the third term are the dominant ones, 
which arc slightly out of phase from each other. The 
corresponding cancellation is responsible for the bursty 
behavior. The dynamical origin of these terms and their 
modeling is rather non-trivial, which needs significant in- 
vestigations. 

The map equations will now be used to carry out 1-step 
out-of-sample predictions beyond the fitted points. The 
predictions for level=6, 7 and 8 are shown in Figs. [24] 
1251 and [26] respectively with corresponding NMSE values 
as 0.04923, 0.03907 and 0.03946 respectively. It can be 
seen that the 1-step predictions are very good. 



B. Modeling variations in Dow Jones Industrial 
Average Closing values 

We next consider modeling of Db4 wavelet coefficients 
for Dow Jones Industrial Average (DJIA) closing values. 
Akin to the GP analysis for Nifty wavelet coefficients, we 
have found it useful to smoothen the wavelet coefficients 
using Cubic Spline for the purpose of GP modeling. This 
also makes it easier to compare the cyclic variations in 
the two time series considered 

The comparison of Db4 forward wavelet coefficients for 
level=6, 7 and 8 with the Cubic Spline smoothened coef- 
ficients are shown in Figs. [27] l28l and l29l 

The map equations generated by GP for d=5 and r=l 
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FIG. 24: Out-of-sample 1-step predictions using GP solution 
for Db4 level-6 wavelet coefficients for Nifty data. 
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FIG. 25: Out-of-sample 1-step predictions using GP solution 
for Db4 level-7 wavelet coefficients for Nifty data. 



are having fitness values 0.998013, 0.99721, and 0.99929 
and are shown in Eq. [Til EH an d HS1 
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FIG. 26: Out-of-sample 1-step predictions using GP solution 
for Db4 level-8 wavelet coefficients for Nifty data. 
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FIG. 27: Db4 wavelet coefficients (wc) and Spline interpolated 
wc for 6 th level Dow Jones data. 
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FIG. 28: Db4 wavelet coefficients (wc) and Spline interpolated 
wc for 7 th level Dow Jones data. 




FIG. 30: Fit of the GP solution for Db4 level-6 Dow Jones 
wavelet coefficients. The variations have a bursty character. 
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FIG. 31: Fit of the GP solution for Db4 level-7 Dow Jones 
wavelet coefficients. The amplitude of the cyclic variations is 
seen to increase with time. 
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FIG. 29: Db4 wavelet coefficients (wc) and Spline interpolated 
wc for 8 th level Dow Jones data. 
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(13) 



As seen in Figs. [HI ED and El for level=6, 7 and 8 
respectively, the GP fits are quite good. 

Similar to the Nifty analysis, the GP equations are 
primarily of linear type having non-linear terms of Pade 
type. Eq. [TT] is primarily linear. The effect of nonlinear- 
ity as seen from the Pade terms is different from the Nifty 
behavior. Eq. [H shows a very interesting behavior. If 
difference between two consecutive data points are small, 
then the Pade term gives a strong contribution, which de- 
creases as the differences increase. Eq. [T3] representing 
level 8 coefficients is again mostly linear. 
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FIG. 32: Fit of the GP solution for Db4 level-8 Dow Jones 
wavelet coefficients. The variations show a step like behavior. 
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FIG. 33: Out-of-sample 1-step predictions using GP solution 
for Db4 level-6 Dow Jones wavelet coefficients. 



We then use these map equations and carry out 1-step 
out-of-sample predictions beyond the fitted points. These 
predictions are found to be very good as is reflected from 
their small NMSE values, 0.001787 (level=6), 0.002379 
(level=7) and 0.0004981 (level=8). The predictions are 
shown for level=6, 7 and 8 in Figs. [33l [34] and [35] respec- 
tively and arc found to be excellent. 



IV. CONCLUSION 

In conclusion, we have illustrated a wavelet based ap- 
proach to separate stochastic and structured variations 
in non-stationary time series. Modeling different aspects 
e.g., fluctuations and trend of these time series is a chal- 
lenging task. It becomes particularly difficult when the 
fluctuations comprise of random, cyclic and transient 
variations at multiple scales. The fact that wavelet trans- 
form possesses multi-resolution analysis capability, has 
opened the way to isolate variations at different scales. 



We have taken advantage of this ability of wavelets to 
study and model cyclic variations of the financial time 
series, which are known to be non-stationary. Genetic 
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FIG. 34: Out-of-sample 1-step predictions using GP solution 
for Db4 level-7 Dow Jones wavelet coefficients. 
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FIG. 35: Out-of-sample 1-step predictions using GP solution 
for Db4 level-8 Dow Jones wavelet coefficients. 



programming models the cyclic behavior well through 
crisp dynamical equations. One step predictions have 
been carried through and these are found to be quite 
accurate. 

Apart from studying other physical time series, it will 
be nice to combine the present approach with random 
matrix based ones for the purpose of pinpointing emer- 
gence of cyclic behavior. As has been mentioned earlier, 
random matrix approach has indicated correlation be- 
tween group of companies in financial time series, which 
can lead to cyclic or structured variations apparent in the 
present analysis. Hence, it will be of deep interest to see 
if these two can be interrelated. 

Amit Verma is thankful to Physical Research Labora- 
tory for providing him a project traineeship during which 
part of this work was done. 
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